Display apparatus, voice acquiring apparatus and voice recognition method thereof

ABSTRACT

Disclosed are a display apparatus, a voice acquiring apparatus and a voice recognition method thereof, the display apparatus including: a display unit which displays an image; a communication unit which communicates with a plurality of external apparatuses; and a controller which includes a voice recognition engine to recognize a user&#39;s voice, receives a voice signal from a voice acquiring unit, and controls the communication unit to receive candidate instruction words from at least one of the plurality of external apparatuses to recognize the received voice signal.

CROSS-REFERENCE TO RELATED APPLICATION

This is a continuation application of U.S. patent application Ser. No.14/076,361 filed Nov. 11, 2013, in the U.S. Patent and Trademark Office,which claims priority from Korean Patent Application No.10-2012-0126650, filed on Nov. 9, 2012, in the Korean IntellectualProperty Office, the disclosures of which are incorporated herein byreference in their entireties.

BACKGROUND

Field

Apparatuses and methods consistent with the exemplary embodiments relateto a display apparatus, a voice acquiring apparatus and a voicerecognition method thereof, and more particularly, to a displayapparatus, a voice acquiring apparatus and a voice recognition methodthereof which recognizes a user's voice.

Description of the Related Art

A voice recognition function is used in various electronic apparatusessuch as a digital television (TV), an air conditioner, a home theater, apersonal computer (PC), and a mobile phone, etc.

To perform the voice recognition function, a main apparatus such as a TVshould have a microphone to receive a user's voice and a voicerecognition engine to recognize the input voice, and the voicerecognition engine may compare the input voice with a stored candidateinstruction words, and recognize the voice according to a result ofcomparison.

However, the related art electronic apparatus which has the voicerecognition function has a fixed means to receive the user's voice, andthus is difficult to utilize various input means such as a mobile phoneinputting voice. Also, if many candidate instruction words are provided,a rate of recognition would be increased, but the electronic apparatusshould compare the candidate instruction words, resulting in a slowervoice recognition processing speed. Further, as the storage capacity ofthe main apparatus is limited, the number of the candidate instructionwords may not be increased continuously.

SUMMARY

According to an aspect of an exemplary embodiment, there is provided adisplay apparatus including: a display unit which displays an imagethereon; a communication unit which communicates with a plurality ofexternal apparatuses; and a controller which includes a voicerecognition engine to recognize a user's voice, receives a voice signalfrom a voice acquiring unit, and controls the communication unit toreceive candidate instruction words from at least one of the pluralityof external apparatuses to recognize the received voice signal.

A plurality of voice acquiring units may be provided. If a voice inputis detected to at least one of the plurality of voice acquiring units,the controller may receive a voice signal from the voice acquiring unitto which the voice input is detected.

The voice acquiring unit may include at least one of a built-inmicrophone provided in the display apparatus, a first externalmicrophone provided in at least one of the plurality of externalapparatuses, and a second external microphone different from thebuilt-in microphone and the first external microphone.

The external apparatus may include at least one application which maymanage the candidate instruction words.

The display apparatus may further include a native application whichmanages the candidate instruction words.

The display apparatus may further include a storage unit which storesthe received candidate instruction words therein, and the voicerecognition engine may recognize the received voice by using the storedcandidate instruction words.

If the at least one of the plurality of voice acquiring units detects awakeup keyword, the controller may enable the voice acquiring unit whichdetects the wakeup keyword, and receive a voice signal from the enabledvoice acquiring unit.

If a trigger signal is input by a manipulation of a predetermined buttonprovided in one of the plurality of voice acquiring units, thecontroller may enable the voice acquiring unit by which the triggersignal is input, and receive a voice signal from the enabled voiceacquiring unit.

The controller may control the display unit to display thereon voicerecognition results for the voice signal and the candidate instructionwords corresponding to the voice recognition results.

The display unit may display thereon information on an application whichmanages the candidate instruction words.

The voice recognition engine may recognize the voice by deciding aninstruction word that is identical to or similar to the received voicesignal, among the received candidate instruction words.

According to an aspect of another exemplary embodiment, there isprovided a voice acquiring apparatus including: a communication unitwhich communicates with a display apparatus having a voice recognitionfunction; a voice acquiring unit which receives a user's voice; a voiceconverter which converts the received voice into an electric voicesignal; and a controller which controls the communication unit totransmit the converted voice signal and candidate instruction words tothe display apparatus to recognize the voice signal.

The voice acquiring apparatus may further include at least oneapplication which may manage the candidate instruction words.

According to an aspect of another exemplary embodiment, there isprovided a voice recognition method of a display apparatus including:receiving a voice signal from a voice acquiring unit; receivingcandidate instruction words from at least one of a plurality of externalapparatuses to recognize the received voice signal; and recognizing auser's voice according to the received voice signal and the candidateinstruction words.

The voice recognition method may further include detecting a voice inputto at least one of a plurality of voice acquiring units, and thereceiving the voice signal may include receiving the voice signal fromthe voice acquiring unit to which the voice input is detected.

The voice acquiring unit may include at least one of a built-inmicrophone provided in the display apparatus, a first externalmicrophone provided in at least one of the plurality of externalapparatuses, and a second external microphone provided in an apparatusdifferent from the display apparatus and the plurality of externalapparatuses.

The external apparatus may include at least one application whichmanages the candidate instruction words.

The display apparatus may include a native application which manages thecandidate instruction words.

The voice recognition method may further include storing the receivedcandidate instruction words, and the recognizing the voice may includerecognizing the voice by using the stored candidate instruction words.

The detecting the voice input may include detecting a wakeup keyword toone of the plurality of voice acquiring units, and enabling the voiceacquiring unit that detects the wakeup keyword.

The detecting the voice input may include detecting an input of atrigger signal according to a manipulation of a predetermined buttonprovided in one of the plurality of voice acquiring units, and enablingthe voice acquiring unit by which the trigger signal is input.

The voice recognition method may further include displaying voicerecognition results for the voice signal and candidate instruction wordscorresponding to the voice recognition results.

The displaying may include displaying information on an application thatmanages the candidate instruction words.

The recognizing the voice may include recognizing the voice by decidingan instruction word that is identical to or similar to the receivedvoice signal, among the received candidate instruction words.

BRIEF DESCRIPTION OF THE DRAWINGS

The above and/or other aspects will become apparent and more readilyappreciated from the following description of the exemplary embodimentswith reference to the accompanying drawings, in which:

FIG. 1 illustrates an example of a voice recognition system according toan exemplary embodiment;

FIG. 2 is a block diagram of the voice recognition system according toan exemplary embodiment;

FIG. 3 illustrates an example of performance of voice recognitionaccording to an exemplary embodiment;

FIG. 4 illustrates an example of a screen which is displayed as a resultof the voice recognition in FIG. 3;

FIG. 5 illustrates an example of performance of voice recognitionaccording to another exemplary embodiment;

FIG. 6 is a flowchart showing a voice recognition method of the voicerecognition system according to an exemplary embodiment;

FIG. 7 is a flowchart showing details of a process of detecting a voiceinput in FIG. 6; and

FIG. 8 is a flowchart showing details of a process of performing voicerecognition in FIG. 6.

DETAILED DESCRIPTION OF EXEMPLARY EMBODIMENTS

Below, exemplary embodiments will be described in detail with referenceto accompanying drawings. The exemplary embodiments may be embodied invarious forms without being limited to the exemplary embodiments setforth herein. Descriptions of well-known parts are omitted for clarity,and like reference numerals refer to like elements throughout.

FIG. 1 illustrates an example of a voice recognition system according toan exemplary embodiment.

As shown in FIG. 1, the voice recognition system includes a mainapparatus 100, a plurality of voice acquiring apparatuses 201 and 202,and a plurality of external apparatuses 301, 302 and 303. The mainapparatus 100, the plurality of voice acquiring apparatuses 201 and 202,and the plurality of external apparatuses 301, 302 and 303 are connectedto one another for mutual communication.

The main apparatus 100 includes a voice acquiring unit 140 such as amicrophone to receive a user's voice, and a voice recognition engine 181to recognize the input voice and to communicate with the plurality ofvoice acquiring apparatuses 201 and 202 and the plurality of externalapparatuses 301, 302 and 303 through a communication unit 160. The mainapparatus 100 further includes native applications 171 and 172, whichare driven for the main apparatus 100 to perform various functions(services). The native applications 171 and 172 store in advance thereincandidate instruction words corresponding to the functions. That is, thenative applications 171 and 172 are included in available servicescenario. The candidate instruction words stored in the nativeapplications 171 and 172 are transmitted to the voice recognition engine181 at the time of voice recognition to enable the voice recognitionengine 181 to perform voice recognition.

Each of the plurality of voice acquiring apparatuses 201 and 202 mayinclude a voice acquiring unit such as a microphone to receive a user'svoice, and a voice signal corresponding to the received voice istransmitted to the main apparatus 100 for voice recognition.

The plurality of voice acquiring apparatuses 201 and 202 may receive auser's voice, convert the voice into an electric voice signal, andtransmit the electric voice signal to the main apparatus 100. Theplurality of voice acquiring apparatuses 201 and 202 may perform awireless communication with the main apparatus 100. While not limitedthereto, the wireless communication includes a wireless LAN, a radiofrequency (RF) communication, a Bluetooth, Zigbee, an infrared (IR)communication, etc.

The plurality of external apparatuses 301, 302 and 303 may include atleast one dev. Application to perform functions (services) as needed.The dev. Application stores in advance therein candidate instructionwords corresponding to the functions performed by the externalapparatuses 301, 302 and 303. The candidate instruction words commandsstored in the dev. Application are transmitted to the voice recognitionengine 181 at the time of voice recognition to enable the voicerecognition engine 181 to perform voice recognition.

The candidate instruction words that are stored in the nativeapplications 171 and 172 and in the dev. Application in advance may beinstruction words related to functions/operations of the applications.For example, if the main apparatus 100 is a TV, candidate instructionwords related to a change of channel, an adjustment of volume, etc. ofthe TV may be stored in one of the native applications 171 and 172. Ifthe external apparatus 302 is an air conditioner, candidate instructionwords related to an adjustment of temperature (up/down), an adjustmentof intensity of wind (strong/weak/moderate), etc. of the air conditionermay be stored in the application included in the external apparatus 302.

The external apparatus or the voice acquiring apparatus may include boththe voice acquiring unit and the dev. application. In this case, if avoice is input to the voice acquiring unit in the first externalapparatus 301, the candidate instruction words stored in advance in thedev. application of the first external apparatus 301 are transmitted tothe voice recognition engine 181 of the main apparatus 100 to performvoice recognition.

The voice recognition system according to the exemplary embodimentincludes at least one voice acquiring unit. If the voice input to thevoice acquiring unit is detected, the voice recognition system receivesa voice stream by enabling the voice acquiring unit to which the voiceinput has been detected. If a plurality of voice acquiring units isprovided, the voice recognition system may receive the voice stream byenabling the voice acquiring unit to which the voice input has beendetected, among the plurality of voice acquiring units. The plurality ofvoice acquiring units may include a built-in microphone provided in themain apparatus 100, a first external microphone provided in at least oneof the plurality of external apparatuses 301, 302 and 303, and a secondexternal microphone provided in the voice acquiring apparatuses 201 and202 which are different from the main apparatus 100 and the plurality ofexternal apparatuses 301, 302 and 303. The voice acquiring apparatuses201 and 202 are separated from the main apparatus 100 and the pluralityof external apparatuses 301, 302 and 303.

If the at least one of the plurality of voice acquiring units detects awakeup keyword, the main apparatus 100 may enable the voice acquiringunit by which the wakeup keyword is detected, and receive a voice signalfrom the enabled voice acquiring unit. If a trigger signal is input by amanipulation of a predetermined button (e.g., an occurrence of an event)in the at least one of the plurality of voice acquiring units, the mainapparatus 100 may enable the voice acquiring unit by which the inputtrigger signal is input and receive the voice signal from the enabledvoice acquiring unit.

The main apparatus 100 may operate in a voice recognition mode. If theat least one voice acquiring unit is enabled by the wakeup keyword orthe trigger signal, the main apparatus 100 may disable other voiceacquiring units to prevent an occurrence of error in voice recognition.The main apparatus 100 may operate in a distant or adjacent voicerecognition mode. The main apparatus 100 may display a user interface(UI) showing the voice acquiring unit connected to a display unit 130(to be described later) for user's convenience.

The main apparatus 100 may receive candidate instruction words from theat least one of the plurality of external apparatuses 301, 302 and 303to recognize the received voice signal. The received candidateinstruction words may be transmitted to the voice recognition engine 181for voice recognition.

The plurality of external apparatuses 301, 302 and 303 include at leastone application which manages the candidate instruction words. The mainapparatus 100 includes native applications 171 and 172, which manage thecandidate instruction words. The candidate instruction words managed bythe native applications 171 and 172 may be transmitted to the voicerecognition engine 181 for voice recognition.

The main apparatus 100 may be implemented as a display apparatus such asa television (TV) as in FIG. 2.

FIG. 2 is a block diagram of the voice recognition system according toan exemplary embodiment.

The display apparatus 100 processes an image signal from an externalimage supply source (not shown) to display an image based on theprocessed image signal.

In the voice recognition system according to the exemplary embodiment,the display apparatus 100 is implemented as the TV or a set-top boxwhich processes a broadcasting image based on broadcastingsignals/broadcasting information/broadcasting data transmitted from abroadcasting station. However, it is understood that in one or moreother exemplary embodiments, the display apparatus 100 may apply tovarious other devices which process and display an image, in addition tothe TV or the set-top box. For example, the display apparatus 100 mayinclude a personal computer (PC), a laptop computer, etc.

Further, it is understood that the type of an image which is displayableby the display apparatus 100 is not limited to the broadcasting image.For example, the display apparatus 100 may display, e.g., a video, astill image, applications, an on screen display (OSD), a graphic userinterface (GUI) to control various operations, based on signals/datatransmitted by various image supply sources (not shown).

According to an exemplary embodiment, the display apparatus 100 may beimplemented as a smart TV. The smart TV may receive and display abroadcasting signal in real-time, have a web browser function to displaythe broadcasting signal in real-time and to search various contentsthrough an Internet, and provide a convenient user environment to do theforegoing. The smart TV may include an open software platform to providea user with an interactive service, and may provide the user withvarious contents through the open software platform, e.g., anapplication providing a predetermined service. The application mayprovide various types of services, e.g., SNS, finance, news, weather,maps, music, movies, games, e-books, etc.

The display apparatus 100 includes the voice recognition engine 181 torecognize a user's voice. A command corresponding to the recognizedvoice, e.g. a control command, is transmitted to a correspondingapplication to perform the operation. If the application correspondingto the control command is one of the native applications 171 and 172,the display apparatus 100 performs an operation according to the controlcommand by the application. If the application corresponding to thecontrol command is a dev. application, the control command istransmitted to the external apparatuses 301, 302 and 303 including thedev. Application. The external apparatuses 301, 302 and 303 may performan operation according to the control command by the application.

Referring to FIG. 2, a plurality of the voice acquiring apparatus isprovided, e.g. a mobile phone 200 and a remote controller 300. Theremote controller 300 may act as both the voice acquiring apparatus andthe external apparatus. The mobile phone 200 may be a smart phone havinga voice acquiring function.

The remote controller 300 may transmit a preset command (controlcommand) to a corresponding apparatus by a user's manipulation. Theremote controller 300 may be set to transmit a command to the displayapparatus 100 or to the external apparatus, and may be implemented as anintegrated remote controller transmitting the command to a plurality ofapparatuses. The remote controller 300 may include a TV remotecontroller and/or an air conditioner remote controller.

The voice acquiring apparatus may be implemented as various apparatuseswhich receive a user's voice, e.g., a mobile handset, a microphonetransmitter, etc.

As shown in FIG. 2, a plurality of the external apparatuses is provided,e.g. the remote controller 300 and the air conditioner 400. As describedabove, the remote controller 300 may act as both the voice acquiringapparatus and the external apparatus.

Although FIG. 2 illustrates the external apparatuses of the remotecontroller 300 and the air conditioner 400, the exemplary embodiment isnot limited thereto. For example, the external apparatuses may beimplemented as other various electronic devices which perform wirelesscommunication, e.g., as a home theater, a radio, VCR, DVD, a washingmachine, a refrigerator, a robot vacuum cleaner, etc. If the externalapparatus includes a voice acquiring unit such as a microphone, theexternal apparatus may also act as a voice acquiring apparatus.

The external apparatuses according to the exemplary embodiment includeapplications 372 and 472 respectively to perform functions. Theapplications 372 and 472 store in advance candidate instruction words,and manage the candidate instruction words. The candidate instructionwords may be transmitted to the display apparatus 100 for voicerecognition.

The external apparatuses, i.e. the remote controller 300 and the airconditioner 400, may perform operations corresponding to the controlcommand transmitted by the display apparatus 100 according to a resultof the voice recognition.

Hereinafter, referring to FIG. 2, each element of the voice recognitionsystem will be described in detail.

The display apparatus 100 may include an image receiver 110 whichreceives an image signal, an image processor 120 which processes theimage signal received from the image receiver 110, a display unit 130which displays an image based on the image signal processed by the imageprocessor 120, a first voice acquiring unit 140 which receives a user'svoice, a first voice converter 150 which converts the received voiceinto an electric voice signal, a first communication unit 160 whichcommunicates with the external apparatus, a first storage unit 170 whichstores various data, and a first controller 180 which controls thedisplay apparatus 100.

The image receiver 110 receives an image signal and transmits the imagesignal to the image processor 120. For example, the image receiver 110may receive a radio frequency (RF) signal in a wireless manner from abroadcasting station (not shown), or receive image signals in a wiredmanner according to standards such as composite video, component video,super video, SCART (Syndicat des Constructeurs d'AppareilsRadiorécepteurs et Téléviseurs,—Radio and Television ReceiverManufacturers' Association), high definition multimedia interface(HDMI), etc. If the image signal includes a broadcasting signal, theimage receiver 110 includes a tuner to tune the broadcasting signal bychannel.

The image signal may be received from the external apparatus, e.g., aPC, an AV device, a smart phone, a smart pad, etc. The image signal maybe data transmitted through a network such as the Internet. In thiscase, the display apparatus 100 may perform a network communicationthrough the first communication unit 160, and may include an additionalnetwork communication unit. Alternatively, the image signal may be datastored in the first storage unit 170, e.g., a flash memory, hard discdrive (HDD), etc. The first storage unit 170 may be providedwithin/outside the display apparatus 100. If the first storage unit 170is provided outside the display apparatus 100, the display apparatus 100may include a connector (not shown) to which the first storage unit 170is connected.

The image processor 120 performs various image processing operationswith respect the image signal, and outputs the processed image signal tothe display unit 130.

The image processing operations of the image processor 120 may include,but not limited thereto, a decoding operation corresponding to variousimage formats, a de-interlacing operation, a frame refresh rateconversion, a scaling operation, a noise reduction operation forimproving an image quality, a detail enhancement operation, a linescanning operation, etc. The image processor 120 may be implemented asindividual groups which independently perform the foregoing operations,or as a system-on-chip (SoC) which performs integrated functions.

The display unit 130 displays an image based on the image signalprocessed by the image processor 120. The display unit 130 may include,but is not limited thereto, a liquid crystal display (LCD), a plasmadisplay panel (PDP), a light emitting diode (LED), an organic lightemitting diode (OLED), a surface-conduction electron-emitter, a carbonnano-tube, a nano-crystal, etc.

The display unit 130 may include additional elements depending on itsembodiment type. For example, the display unit 130 as an LCD typeincludes an LCD panel (not shown), a backlight unit (not shown) emittinglight to the LCD panel, and a panel driving substrate (not shown) todrive the LCD panel.

The display unit 130 may display voice recognition results asinformation on the recognized voice. The voice recognition results maybe displayed in various forms such as texts, graphics, icons, etc. Textsinclude characters and numbers. The display unit 130 may further displaycandidate instruction words according to the voice recognition resultsand application information. This will be described in more detail laterwith reference to FIG. 4.

A user may check whether the voice has been correctly recognized, basedon the voice recognition results displayed on the display unit 130. Theuser may manipulate a user input unit 330 in the remote controller 300to select an instruction word corresponding to a user's voice from thedisplayed candidate instruction words, or may select and checkinformation related to the voice recognition results.

The first voice acquiring unit 140 receives a user's voice, and may beimplemented as a microphone.

The first voice converter 150 converts the voice input by the firstvoice acquiring unit 140 into an electric voice signal. The convertedvoice signal may be in a pulse code modulation (PCM) or in a compressedaudio waveform. The first voice converter 150 may be implemented as anA/D converter which converts the user's voice into a digital form.

If the first voice acquiring unit 140 is a digital microphone, it doesnot require an additional A/D conversion. In this case, the first voiceacquiring unit 140 may include the first voice converter 150.

The first communication unit 160 communicates with the voice acquiringapparatus and external apparatus, i.e., with the mobile phone 200, theremote controller 300, and the air conditioner 400. The firstcommunication unit 160 may perform a wireless communication, whichincludes at least one of infrared communication, RF, Zigbee, andBluetooth.

The first storage unit 170 stores data by a control of the firstcontroller 180. The first storage unit 170 is implemented as anon-volatile storage medium such as a flash memory, a hard disc drive(HDD), etc. The first storage unit 170 is accessed by the firstcontroller 180, which reads/writes/modifies/deletes/updates data.

The data stored in the first storage unit 170 includes, e.g., anoperating system (OS) for driving the display apparatus 100, variousapplications executed on the OS, image data, and additional data, etc.

The first storage unit 170 may store various data to recognize a user'svoice. For example, the first storage unit 170 may store an instructionword table 171 including candidate instruction words (hereinafter, to bealso called candidate instruction word group), as recognized voiceinformation corresponding to the received voice signal. In theinstruction word table 171, candidate instruction words may be managedby a corresponding application.

The first storage unit 170 may further store at least one application,e.g., first and second applications 172 and 173 to perform functions ofthe display apparatus 100. The first and second applications 172 and 173are driven by a control of the first controller 180 (to be describedlater), and perform various functions of the display apparatus 100.Although FIG. 2 illustrates the display apparatus 100 in which twoapplications 172 and 173 are installed, the exemplary embodiments arenot limited thereto. That is, three or more applications may beinstalled in the display apparatus 100.

The first and second applications 172 and 173 may manage candidateinstruction words corresponding to performed functions. The candidateinstruction words which are managed by the first and second applications172 and 173 may be registered with/deleted from the instruction wordtable 171.

If the candidate instruction words are registered with the instructionword table 171, the voice recognition engine 181 performs voicerecognition by using the candidate instruction words in the instructionword table 171.

The candidate instruction words which may be registered with/deletedfrom the instruction word table 171 may include candidate instructionwords managed by a third application 372 of the remote controller 300(to be described later) and candidate instruction words managed by afourth application 472 of the air conditioner 400.

The first controller 180 controls various elements of the displayapparatus 100. For example, the first controller 180 controls the imageprocessor 120 to process an image signal, and performs a controloperation in response to a command from the remote controller 300 tocontrol overall operations of the display apparatus 100.

The first controller 180 may be implemented, e.g., as a centralprocessing unit (CPU) combined with software.

The first controller 180 may include the voice recognition engine 181 torecognize a user's voice. A voice recognition function of the voicerecognition engine 181 may be performed by using a known voicerecognition algorithm. For example, the voice recognition engine 181extracts voice characteristic vector of the voice signal, and comparesthe extracted voice characteristic vector with the candidate instructionwords stored in the instruction word table 171 of the first storage unit170 to recognize voice. If there is no candidate instruction wordsstored in the instruction word table 171 which is identical to the voicecharacteristic vector, the voice recognition engine 181 may recognizethe voice by adjusting the voice recognition results with the mostsimilar instruction word. If there is a plurality of similar candidateinstruction words, the first controller 180 may display the plurality ofcandidate instruction words on the display unit 130 for the user toselect one of the plurality of candidate instruction words.

While not restricted thereto, the voice recognition engine 181 accordingto the exemplary embodiment is implemented as an embedded voicerecognition engine 181 provided in the CPU. For example, the voicerecognition engine 181 may be implemented as an apparatus provided inthe display apparatus 100 separately from the CPU, i.e., implemented asan additional chip such as a microcomputer.

While not restricted thereto, the exemplary embodiment includes thevoice recognition engine 181 which is provided in a server that isseparated from the display apparatus 100 (hereinafter, to be called acloud server (not shown)). The cloud server communicates with thedisplay apparatus 100 through a network such as the Internet. Thenetwork may be a wired or wireless network. In this case, the voicerecognition engine 181 may be implemented as an embedded voicerecognition engine provided in a CPU of the cloud server, or as anapparatus provided in the cloud server separately from the CPU, i.e., asan additional chip such as a microcomputer.

The first controller 180 may perform an operation corresponding to arecognition result of the voice recognition engine 181. For example, ifthe display apparatus 100 is a TV and a user is watching a movie ornews, the voice recognition engine 181 may recognize voice such as“volume up”, “volume down”, “louder”, “lower”, etc., and the firstcontroller 180 may adjust volume of the movie or news according to thevoice.

If the voice recognition engine 181 recognizes the voice for controllingthe external apparatus such as the remote controller 300 or the airconditioner 400, the first controller 180 may control the firstcommunication unit 160 to transmit a control command to the externalapparatus corresponding to the recognized voice. For example, if thevoice recognition engine 181 recognizes a voice of “raise temperature”,the first controller 180 may recognize that the voice is for controllingthe air conditioner 400, and control the first communication unit 160 totransmit a command to the air conditioner 400 to raise the temperatureof the air conditioner 400.

Hereinafter, a detailed configuration of the mobile phone 200 will bedescribed.

As shown in FIG. 2, the mobile phone 200 may include a second voiceacquiring unit 240 which receives a user's voice, a second voiceconverter 250 which converts the received voice into an electric voicesignal, a second communication unit 260 which communicates with theoutside, a second storage unit 270 which stores data, and a secondcontroller 280 which controls the mobile phone 200.

The second voice acquiring unit 240 which receives the user's voice maybe implemented as a microphone. The second voice converter 250 convertsthe received voice into an electric voice signal. The converted voicesignal may be in a pulse code modulation (PCM) or in a compressed audiowaveform. The second voice converter 250 may be implemented as an A/Dconverter which converts the user's input voice into a digital form.

If the second voice acquiring unit 240 is a digital microphone, it doesnot require an additional A/D conversion. In this case, the second voiceacquiring unit 240 may include the second voice converter 240.

The second communication unit 260 which communicates with the displayapparatus 100 may perform a wired or wireless communication. Thewireless communication may include at least one of RF, Zigbee, andBluetooth.

The second communication unit 260 may transmit the voice signal from thesecond voice converter 250 to the display apparatus 100.

The second storage unit 270 may store data by a control of the secondcontroller 280. The second storage unit 270 is implemented as anon-volatile storage medium such as a flash memory. The second storageunit 270 is accessed by the second controller 280, whichreads/writes/modifies/deletes/updates data.

The data stored in the second storage unit 270 may include, e.g., an OSfor driving the mobile phone 200, various applications executed on theOS, image data, additional data, etc.

The second controller 280 may control various elements of the mobilephone 200. For example, the second controller 280 may generate a commandin response to a user's manipulation, perform an operation correspondingto the generated command, and display a result on a display unit (notshown).

The second controller 280 may be implemented as a micro controller unit(MCU) combined with software.

If the user's voice is input through the second voice acquiring unit240, the second controller 280 controls the second voice converter 250to convert the user's voice into an electric voice signal and controlsthe second communication unit 260 to transmit the converted voice signalto the display apparatus 100.

Hereinafter, a detailed configuration of the remote controller 300 willbe described.

As shown in FIG. 2, the remote controller 300 as a voice acquiringapparatus and an external apparatus, may include a user input unit 330which receives a user's manipulation, a third voice acquiring unit 340which receives a user's voice, a third voice converter 350 whichconverts the received voice into an electric voice signal, a thirdcommunication unit 360 which communicates with the outside, a thirdstorage unit 370 which stores data, and a third controller 280 whichcontrols the remote controller 300.

The user input unit 330 may transmit various control commands orinformation to the third controller 380 by a user's manipulation andinput. The user input unit 330 may be implemented as a menu key, anumber key, etc. provided in the remote controller 300. If the remotecontroller 300 is a TV remote controller, the user input unit 330 mayinclude a touch sensor to receive a user's touch input, and/or a motionsensor to sense a motion of the remote controller 300.

The third voice acquiring unit 340 which receives the user's voice maybe implemented as a microphone.

The third voice converter 350 converts the voice input by the thirdvoice acquiring unit 340 into an electric voice signal. The convertedvoice signal may be in a pulse code modulation (PCM) or in a compressedaudio waveform. The third voice converter 350 may be implemented as anA/D converter which converts a user's input voice into a digital form.

If the third voice acquiring unit 340 is a digital microphone, it doesnot require an additional A/D conversion. In this case, the third voiceacquiring unit 340 may include the third voice converter 350.

The third communication unit 360 communicates with the display apparatus100. The third communication unit 360 performs wireless communication.The wireless communication includes at least one of RF, Zigbee, andBluetooth.

The third communication unit 360 transmits the voice signal from thethird voice converter 350 and the candidate instruction words managed bythe third application 372 of the third storage unit 370 (to be describedlater) to the display apparatus 100.

The third storage unit 370 which stores data by a control of the thirdcontroller 380 may be implemented as a non-volatile storage medium suchas a flash memory, etc. The third storage unit 370 is accessed by thethird controller 380, which reads/writes/modifies/deletes/updates data.

The data stored in the third storage unit 370 may include, e.g., an OSfor driving the remote controller 300, various applications executed onthe OS, image data, and additional data, etc.

The third storage unit 370 may further store at least one application,e.g., the third application 372 to perform functions of the remotecontroller 300. The third application 372 is driven by a control of thethird controller 380 (to be described later), and performs variousfunctions of the remote controller 300. Here, the third application 372and a fourth application 472 (to be described later) will be called dev.applications to be distinguished from the native applications 172 and173 of the display apparatus 100.

Although FIG. 2 illustrates the remote controller 300 in which oneapplication 372 is installed, the exemplary embodiment is not limitedthereto. That is, two or more applications may be installed in theremote controller 300.

The third application 372 may manage candidate instruction wordscorresponding to performed functions. The candidate instruction wordsmanaged by the third application 372 may be registered with/deleted fromthe instruction word table 171 of the display apparatus 100.

The third controller 380 may control various elements of the remotecontroller 300. For example, the third controller 380 may generate acommand in response to the user's manipulation of the user input unit330, and control the third communication unit 360 to transmit thegenerated command to the display apparatus 100.

The third controller 380 may be implemented as an MCU combined withsoftware.

If a user's voice is input through the third voice acquiring unit 340,the third controller 380 controls the third voice converter 350 toconvert the user's voice into an electric voice signal and controls thethird communication unit 360 to transmit the converted voice signal tothe display apparatus 100.

While communicating with the display apparatus 100, the third controller380 may transmit the candidate instruction words managed by the thirdapplication 372 of the third storage unit 370 to the display apparatus100. The transmitted candidate instruction words are registered with theinstruction word table 171 of the display apparatus 100 and used torecognize voice by the voice recognition engine 181.

If a control command is transmitted by the display apparatus 100 as aresult of the voice recognition, the third controller 380 may receivethe control command through the third communication unit 360 and performan operation corresponding to the received control command.

Hereinafter, a detailed configuration of the air conditioner 400 will bedescribed.

As shown in FIG. 2, the air conditioner 400 as an external apparatus mayinclude a fourth communication unit 460 which communicates with theoutside, a fourth storage unit 470 which stores data, and a fourthcontroller 480 which controls the air conditioner 400.

The fourth communication unit 460 which communicates with the displayapparatus 100 may perform wireless communication, which includes atleast one of RF, Zigbee, and Bluetooth.

The fourth communication unit 460 transmits candidate instruction wordsmanaged by the fourth application 472 of the fourth storage unit 470 (tobe described later) to the display apparatus 100.

The fourth storage unit 470 which stores data by a control of the fourthcontroller 480 may be implemented as a non-volatile storage medium suchas a flash memory. The fourth storage unit 470 is accessed by the fourthcontroller 480, which reads/writes/modifies/deletes/updates data.

The data stored in the fourth storage unit 470 may include, e.g., an OSfor driving the air conditioner 400, various applications executed onthe OS, image data, and additional data, etc.

The fourth storage unit 470 may further store at least one application(dev. application), e.g., a fourth application 472 to perform functionsof the air conditioner 400. The fourth application 472 is driven by acontrol of the fourth controller 480 (to be described later), andperforms various functions of the air conditioner 400.

Although FIG. 2 illustrates the air conditioner 400 in which oneapplication 472 is installed, the exemplary embodiment is not limitedthereto. That is, two or more applications may be installed in the airconditioner 400.

The fourth application 472 manages candidate instruction wordscorresponding to performed functions. The candidate instruction wordsmanaged by the fourth application 472 may be registered with/deletedfrom the instruction word table 171 of the display apparatus 100.

The fourth controller 480 controls various elements of the airconditioner 400. For example, the fourth controller 480 may receive acontrol command in response to the user's manipulation of a remotecontroller of the air conditioner 400, and perform a control operation,e.g., adjust temperature, according to the generated control command.

The fourth controller 480 may be implemented as an MCU combined withsoftware.

While communicating with the display apparatus 100, the fourthcontroller 480 may transmit the candidate instruction words managed bythe fourth application 472 of the fourth storage unit 470 to the displayapparatus 100. The transmitted candidate instruction words areregistered with the instruction word table 171 of the display apparatus100 and used to recognize voice by the voice recognition engine 181.

If a control command is transmitted by the display apparatus 100 as aresult of the voice recognition, the fourth controller 480 may receivethe control command through the fourth communication unit 460 andperform an operation corresponding to the received control command.

If a voice input is detected to at least one of the plurality of voiceacquiring units 140, 240 and 340, the first controller 180 of thedisplay apparatus 100 as the main apparatus of the voice recognitionsystem according to the exemplary embodiment controls the firstcommunication unit 140 to receive the voice signal from the voiceacquiring unit to which the voice input has been detected. The firstcontroller 180 receives the candidate instruction words from at leastone of the dev. applications 372 and 472 of the plurality of externalapparatuses 300 and 400 through the first communication unit 140 or fromthe native applications 172 and 173 of the display apparatus 100 torecognize the received voice signal, and registers the transmittedcandidate instruction words with the instruction word table 171 of thefirst storage unit 170. The voice recognition engine 181 compares thecandidate instruction words registered with the instruction word table171 with the voice signal and recognizes the voice.

The display apparatus 100 may detect voice input through variousapparatuses which input a user's voice. The display apparatus 100 mayuse the candidate instruction words which are provided by theapplication to recognize voice, and may dynamically register/delete thecandidate instruction words for voice recognition. Thus, the candidateinstruction words of the display apparatus 100 may be prevented frombeing increased unnecessarily.

The display apparatus 100 may receive the voice recognition from thevoice acquiring unit 140, receive candidate instruction words from theat least one of the native applications, 172 and 173 or from the dev.Applications 372 and 472, and perform voice recognition with the voicerecognition engine 181.

Hereinafter, the voice recognition system which recognizes voiceaccording to the exemplary embodiment will be described in more detailwith reference to FIGS. 3 and 4.

FIG. 3 illustrates an example of performing voice recognition, and FIG.4 illustrates an example of a screen that displays voice recognitionresults in FIG. 3.

As shown in FIG. 3, the display apparatus 100 may have registeredcandidate instruction words, which are provided by at least oneapplication (including native application and dev. Application) andstored in the instruction word table 171.

For example, instruction words A and B are transmitted by the firstapplication 172 (i.e., native application) to the instruction word table171 (501), and are stored in the instruction word table 171 (502). Thevoice recognition engine 181 registers the instruction words A and Bstored in the instruction word table 171 as candidate instruction words(504).

Instruction words C and D are transmitted by the third application 372(i.e., dev. Application) to the instruction word table 171 (505), andare stored in the instruction word table 171 (506). The voicerecognition engine 181 registers the instruction words C and D stored inthe instruction word table 171 as the candidate instruction words (508).

Accordingly, the voice recognition engine 181 registers the instructionwords A, B, C and D which are transmitted by the first and thirdapplications 172 and 372 as candidate instruction words.

For example, when the instruction words A, B, C and D are registered asthe candidate instruction words, an input of a voice A to the secondvoice acquiring unit 240 separated from the display apparatus 100 may bedetected. The detected voice A is converted into a voice signal by thesecond voice converter 250, and transmitted to the voice recognitionengine 181 through the second and first communication units 260 and 160(509).

The voice recognition engine 181 compares the voice signal of the voiceA with the registered candidate instruction words A, B, C and D, decidesidentical or similar commands, and recognizes the voice A (510).

The first controller 180 may transmit the recognition results to thedisplay unit 130 (511), and the display unit 130 may display the voicerecognition results as in FIG. 4.

As shown in FIG. 4, the display unit 130 may display a UI showing avoice recognition result “A” 60, and the candidate instruction words A,B, C and D 61, 62, 63 and 64 according to the voice recognition results.The display unit 130 may further display a UI showing applicationinformation (the first application) 65 managing the instruction word Aaccording to the voice recognition results.

Through the UI displayed on a screen, a user may check the voicerecognition results and the candidate instruction words. If the voicerecognition results do not meet his/her intention of speaking, the usermay select one of the candidate instruction words. The user may obtaininformation of an apparatus related to the voice recognition resultsthrough the application information.

The first controller 180 transmits a control command to the firstapplication 172 according to the voice recognition results as in FIG.3(512). The first application 172 performs a control operation accordingto the recognized voice A by a control of the first controller 180. Forexample, if the voice A is “volume down”, the volume of the displayapparatus 100 is lowered.

As described in FIGS. 3 and 4, the display apparatus 100 may registerthe instruction words (e.g., A, B, C and D) of some applications inadvance, and if a user's voice is detected, may recognize the voice,display the voice recognition results, and perform a correspondingcontrol operation based on the registered candidate instruction words.

Although FIGS. 3 and 4 illustrate the instruction words of the firstapplication 172 and the third application 372 which are registered asthe candidate instruction words, and a user's voice is input through thesecond voice acquiring unit 240, the exemplary embodiment is not limitedthereto. For example, the instruction words may be transmitted by othervarious native and dev. applications to register/delete the candidateinstruction words, and voice is input through various voice acquiringunits.

Hereinafter, a voice recognition system which performs voice recognitionaccording to another exemplary embodiment will be described in detailwith reference to FIG. 5.

FIG. 5 illustrates an example of performing voice recognition accordingto another exemplary embodiment.

As shown in FIG. 5, an input of a voice E to the third voice acquiringunit 340 that is separated from the display apparatus 100 may bedetected. The detected voice E is converted into a voice signal by thethird voice converter 350, and transmitted to the voice recognitionengine 181 through the third and first communication units 360 and 160(701).

The display apparatus 100 may have registered candidate instructionwords. For example, instruction words E and F are transmitted by thethird application 372 to the instruction word table 171 (702), andstored in the instruction word table 171 (703). The voice recognitionengine 181 registers the instruction words E and F stored in theinstruction word table 171 as candidate instruction words (705).

That is, the instruction words E and F which are transmitted by thethird application 372 are registered as the candidate instruction wordswith the voice recognition engine 181.

When the instruction words E and F are registered as the candidateinstruction words, the voice recognition engine 181 compares the voicesignal of the voice E with the registered candidate instruction words Eand F, decides identical or similar instruction words, and recognizesthe voice E (706).

The first controller 180 transmits the recognition results to thedisplay unit 130 (707), and the display unit 130 may display the voicerecognition results.

The first controller 180 transmits a control command to the thirdapplication 372 according to the voice recognition results (708). Thethird application 372 performs a control operation according to therecognized voice E by a control of the third controller 380. If thecontrol command transmitted according to the voice recognition resultsis a command for controlling the display apparatus 100, the controlcommand may be transmitted to the first application 172 or the secondapplication 173.

As shown in FIG. 5, if a user's voice is detected, the display apparatus100 may register the instruction words (e.g. E and F) of the applicationcorresponding to the apparatus to which the voice has been input, as thecandidate instruction words, recognize the voice based on the registeredcandidate instruction words, display the voice recognition results, andperform a corresponding control operation.

Although FIG. 5 illustrates that a user's voice is input through thethird voice acquiring unit 340, and the instruction words of the thirdapplication 372 are registered as the candidate instruction words, theexemplary embodiment is not limited thereto. For example, the voice maybe input through various voice acquiring units, and instruction wordsmay be transmitted by various native and dev. Applications toregister/delete the candidate instruction words.

Hereinafter, a voice recognition method of the voice recognition systemaccording to an exemplary embodiment will be described with accompanyingdrawings.

FIG. 6 is a flowchart showing a voice recognition method of the voicerecognition system according to the exemplary embodiment.

As shown in FIG. 6, the voice recognition system may detect the voiceinput to at least one of the plurality of voice acquiring units 140, 240and 340 (operation S810). The detected voice is converted into anelectric voice signal by the voice converters 150, 250 and 350.

The first controller 180 receives the voice signal (operation S820). Ifthe voice input to the external voice acquiring units 240 and 340 isdetected, the voice signal may be received through the firstcommunication unit 160.

The voice recognition engine 181 registers the candidate instructionwords to recognize the voice based on the voice signal (operation S830).The registered candidate instruction words may be the words stored inadvance in the instruction word table 171, or received through thenative or dev. applications 172, 173, 372 and 472, and stored in theinstruction word table 171.

The voice recognition engine 181 recognizes the user's voice based onthe stored candidate instruction words (operation S840).

The first controller 180 displays the voice recognition results on thedisplay unit 130. The display unit 130 may display the voice recognitionresults for the voice signal, the candidate instruction words accordingto the voice recognition results, and the application information.

The first controller 180 generates a control command according to thevoice recognition results and transmits the control command to theapplication (operation S860). Accordingly, the operation may beperformed by the generated control command.

FIG. 7 is a flowchart showing details of a process of detecting a voiceinput in FIG. 6.

As shown in FIG. 7, the display apparatus 100 as the main apparatus mayoperate in a voice input mode in which a user's voice input is possible(operation S811). In the voice input mode, the voice may be inputthrough various voice acquiring units 140, 240 and 340.

For example, the first controller 180 may detect a speaking of a wakeupkeyword from one of the plurality of voice acquiring units 140, 240 and340 (operation S812). The wakeup keyword enables the voice input througha particular voice acquiring unit, and may be set in advance. Forexample, the first voice acquiring unit 140 of the display apparatus 100may set voice related to a control of TV such as channels and volume,etc., as the wakeup keyword. The second voice acquiring unit 240 of themobile phone 200 may set voice related to call, contact information,etc., as the wakeup keyword.

Alternatively, if a trigger signal is input by one of the plurality ofvoice acquiring units 140, 240 and 340 as a result of a manipulation ofa predetermined button (voice input button), the first controller 180may detect the voice input by the voice acquiring unit (operation S813).For example, if a user manipulates the voice input button provided in aparticular voice acquiring apparatus, the voice input to the voice inputunit of the particular voice acquiring apparatus is detected.

According to the detection, the first controller 180 enables one of theplurality of voice acquiring units 140, 240 and 340 to which the voiceis input (operation S812). As one of the voice acquiring units isenabled, detection of unnecessary voice and malfunction may beprevented.

The voice signal is transmitted by the enabled voice acquiring unit tothe voice recognition engine 181 to perform voice recognition.

FIG. 8 is a flowchart showing details of a process of performing voicerecognition in FIG. 6.

As shown in FIG. 8, the voice recognition engine 181 may receivecandidate instruction words from at least one of the plurality ofapplications 172, 173, 372 and 472, and register the candidateinstruction words (operation S830).

The voice recognition engine 181 may determine whether the registeredcandidate instruction words are identical to/similar to the receivedvoice signal (operation S841).

If it is determined that there are identical or similar candidateinstruction words, the voice recognition engine 181 decides theidentical/similar instruction words and performs voice recognition, andthe first controller 180 displays the voice recognition results on thedisplay unit 130 (operation S850).

If it is determined that there is no identical or similar candidateinstruction words, the voice recognition engine 181 may decide whetherto receive and register the candidate instruction words of otherapplications (operation S842). The first controller 180 may receive andregister the candidate instruction words of other applications accordingto a user's selection or an input, and may receive and register thecandidate instruction words of the plurality of applications in a presetorder. In consideration of the capacity of the first storage unit 170 ofthe display apparatus 100, the previous registered candidate instructionwords may be selectively deleted.

That is, if no candidate instruction word is identical to/similar to theregistered candidate instruction words, the operations S842 and S841 aresequentially performed to perform voice recognition.

If it is decided at operation S842 not to receive and register thecandidate instruction words of other applications, the voice recognitionengine 181 suspends the voice recognition, and the first controller 180may display the failure to voice recognition on the display unit 130.

As the main apparatus according to the exemplary embodiment detectsvoices input by various apparatuses which receive a user's voice,various voice acquiring apparatuses may be used, and linked services maybe provided by the voice acquiring apparatuses.

The candidate instruction words for voice recognition are transmitted bythe plurality of applications, and registered/deleted. Thus, thecandidate instruction words of the main apparatus are not unnecessarilyincreased, delay in the processing speed or deterioration of the rate ofrecognition may be prevented, and the overall efficiency of the voicerecognition system may be improved.

The voice recognition results, the candidate instruction words,application information as well as information on various voiceacquiring apparatuses, and apparatuses providing candidate instructionwords used for voice recognition are recognized more easily by a userand user's convenience is enhanced.

While not restricted thereto, the exemplary embodiments can be writtenas computer programs and can be implemented in general-use digitalcomputers that execute the programs using a computer readable recordingmedium. Examples of the computer readable recording medium includemagnetic storage media (e.g., ROM, floppy disks, hard disks, etc.) andoptical recording media (e.g., CD-ROMs, or DVDs). Also, the exemplaryembodiments may be written as computer programs transmitted over acomputer-readable transmission medium, such as a carrier wave, andreceived and implemented in general-use digital computers that executethe programs. Moreover, while not required in all aspects, one or moreunits of the apparatus can include a processor or microprocessorexecuting a computer program stored in a computer-readable medium, suchas a local storage.

Although a few exemplary embodiments have been shown and described, itwill be appreciated by those skilled in the art that changes may be madein these exemplary embodiments without departing from the principles andspirit of the inventive concept, the range of which is defined in theappended claims and their equivalents.

What is claimed is:
 1. A display apparatus comprising: a display; astorage; a communicator comprising circuitry configured to communicatewith an external apparatus; and a processor configured to: obtain aplurality of instruction words corresponding to at least oneapplication; control the storage to store the obtained plurality ofinstruction words; register, as candidate instruction words, in a voicerecognition program, the plurality of instruction words stored in thestorage; based on a wakeup keyword being received from one of a firstexternal voice acquiring unit and a second external voice acquiringunit, enable reception of a voice signal from the one of the firstexternal voice acquiring unit and the second external voice acquiringunit from which the wakeup keyword is received, and disable reception ofa voice signal from remaining one of the first external voice acquiringunit and the second external voice acquiring unit; based on a voicesignal being received from the enabled one of the first external voiceacquiring unit and the second external voice acquiring unit, perform avoice recognition process using the voice recognition program, torecognize an instruction word corresponding to the received voicesignal, among the registered candidate instruction words; identify oneof the at least one application to perform the recognized instructionword; and perform a control operation for the identified one of the atleast one application to operate based on the recognized instructionword.
 2. The display apparatus according to claim 1, wherein the atleast one application is stored in the external apparatus.
 3. Thedisplay apparatus according to claim 1, wherein the at least oneapplication comprises the plurality of instruction words.
 4. The displayapparatus according to claim 1, wherein the plurality of instructionwords corresponding to the at least one application is obtained from theexternal apparatus corresponding to the at least one application or froma server.
 5. The display apparatus according to claim 1, wherein theprocessor is further configured to, based on the voice recognitionprocess failing, identify another application, based on a user input. 6.The display apparatus according to claim 1, wherein the processor isfurther configured to, based on a memory state of the storage, controlthe storage to delete the stored plurality of instruction words.
 7. Thedisplay apparatus according to claim 1, wherein the wakeup keyword isinput by a manipulation of a predetermined button that is provided inthe one of the first external voice acquiring unit and the secondexternal voice acquiring unit.
 8. The display apparatus according toclaim 1, wherein the processor is further configured to, based on thevoice signal being received from the enabled one of the first externalvoice acquiring unit and the second external voice acquiring unit:identify, using the voice recognition program, whether at least oneamong the registered candidate instruction words corresponding to afirst one of the at least one application is identical or similar to theinstruction word corresponding to the received voice signal; based onthe at least one among the registered candidate instruction wordscorresponding to the first one of the at least one application beingidentified to be identical or similar to the instruction word, controlthe display to display a voice recognition result indicating that the atleast one among the registered candidate instruction words correspondingto the first one of the at least one application is identified to beidentical or similar to the instruction word; and based on the at leastone among the registered candidate instruction words corresponding tothe first one of the at least one application being identified to be notidentical or similar to the instruction word, identify, using the voicerecognition program, whether at least one among the registered candidateinstruction words corresponding to a second one of the at least oneapplication is identical or similar to the instruction wordcorresponding to the received voice signal.
 9. A voice recognitionmethod of a display apparatus, the method comprising: obtaining aplurality of instruction words corresponding to at least oneapplication; storing the obtained plurality of instruction words;registering, as candidate instruction words, in a voice recognitionprogram, the stored plurality of instruction words; based on a wakeupkeyword being received from one of a first external voice acquiring unitand a second external voice acquiring unit, enabling reception of avoice signal from the one of the first external voice acquiring unit andthe second external voice acquiring unit from which the wakeup keywordis received, and disabling reception of a voice signal from remainingone of the first external voice acquiring unit and the second externalvoice acquiring unit; based on a voice signal being received from theenabled one of the first external voice acquiring unit and the secondexternal voice acquiring unit, performing a voice recognition processusing the voice recognition program, to recognize an instruction wordcorresponding to the received voice signal, among the registeredcandidate instruction words; identifying one of the at least oneapplication to perform the recognized instruction word; and performing acontrol operation for the identified one of the at least one applicationto operate based on the recognized instruction word.
 10. The methodaccording to claim 9, wherein the at least one application is stored inan external apparatus.
 11. The method according to claim 9, wherein theat least one application comprises the plurality of instruction words.12. The method according to claim 9, wherein the plurality ofinstruction words corresponding to the at least one application isobtained from an external apparatus corresponding to the at least oneapplication or from a server.
 13. The method according to claim 9,further comprising, based on the voice recognition process failing,identifying another application, based on a user input.
 14. The methodaccording to claim 9, further comprising, based on a memory state of astorage storing the plurality of instruction words, deleting the storedplurality of instruction words.
 15. A display apparatus comprising: adisplay; a communicator comprising circuitry configured to communicatewith a plurality of external apparatus; and a processor configured to:obtain a plurality of instruction words corresponding to at least oneapplication; register, as candidate instruction words, in a voicerecognition program, the plurality of instruction words; based on awakeup keyword being received from a voice acquiring unit, enablereception of a voice signal from the voice acquiring unit from which thewakeup keyword is received, and disable reception of a voice signalwhich is successively received from another voice acquiring unit; basedon the received voice signal from the voice acquiring unit, perform avoice recognition process using the voice recognition program, torecognize an instruction word corresponding to the received voicesignal, among the registered candidate instruction words; identify oneof the at least one application to perform the recognized instructionword; perform a control operation for the identified one of the at leastone application to operate based on the recognized instruction word; anddisplay a voice recognition result indicating that the at least oneamong the registered candidate instruction words corresponding to thefirst one of the at least one application.
 16. The display apparatusaccording to claim 15, wherein the wakeup keyword is input by amanipulation of a predetermined button that is provided in the voiceacquiring unit.
 17. The display apparatus according to claim 15, whereinthe processor is further configured to, based on the voice signal beingreceived from the voice acquiring unit, identify, using the voicerecognition program, whether at least one among the registered candidateinstruction words corresponding to a first one of the at least oneapplication is identical or similar to the instruction wordcorresponding to the received voice signal.
 18. The display apparatusaccording to claim 15, wherein the processor is further configured to,based on the voice signal being received from the voice acquiring unit,and based on the at least one among the registered candidate instructionwords corresponding to the first one of the at least one applicationbeing identified to be not identical or similar to the instruction word,identify, using the voice recognition program, whether at least oneamong the registered candidate instruction words corresponding to asecond one of the at least one application is identical or similar tothe instruction word corresponding to the received voice signal.